Careers
←Job Openings
Job Description:
- Design & document the ETL process of the data flow
- Analyse & architect the tools and technical requirements for implementation
- Write hive queries to fetch the daily load of Client data which is in 16 tables and get a single file ready to export
- Reduce the IO operations on the YARN side as the data movement is very huge between the tables as the data in tables may spawn upto 7TB
- Implement hive optimization techniques to reduce the data querying time
- Write shell scripts to automate the complete ETL process
- Write UDF’s in JAVA to encrypt & decrypt the PII data in runtime
- Use SFTP to securely transfer the data to omnicell
- Schedule and run the job on a daily basis using concord jobs
- Automate the CI/CD process using Jenkins
- Document the best practices
- Deploy the application to higher environments Test & Prod
- Create CDH cluster for dev environment and provide access to different teams
- Use kerberos authentication to submit jobs to YARN
- Use TEZ as hive execution query engine to effectively process the data in memory
- Responsible for the whole ETL process
Required Skills:
- Java, Scala, Python, Shell, SQL, Hive, Tez, Mapreduce, Spark, HDFS, Concord, Git, Maven, Jenkins, Windows, Linux, Cassandra, Hbase
If you are interested in working in a fast-paced, challenging, fun, entrepreneurial environment and would like to have the opportunity of being a part of this fascinating industry, Please send resume to Bharath Bommareddy, President, HSTechnologies, L.L.C., 2801 West Parker Road, Suite 5, Plano, TX 75023.or email your resume to hr@sbhstech.com.
This position qualifies for employee referal bonus. Bonus to be paid is $1,000 if the referal is hired.